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Abstract 

The author uses a Stein-type covariance identity to obtain moment estimators for 
^yy • the parameters of the quadratic polynomial subfamily of Pearson distributions. 
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The asymptotic distribution of the estimators is obtained, and normality and 
symmetry tests based on it are provided. Simulation is used to compare the 
performance of the proposed tests with that of other existing tests for symmetry 
and normality. 
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1 Introduction 

Let X be a continuous random variable (r.v.) with probability density function (p.d.f.) 
/ and finite mean /i. We say that / has a Pearson quadratic form (see [IS]), q(x) = 
S(x — n) 2 + f3(x — fi) + 7, if it satisfies the identity 

X : / (/J, - t)f(t)dt = q{x)f(x) for all x E E. (1.1) 



Remark 1.1. Let X be a continuous r.v. which satisfies (11.11) . Then the support of X 
is the interval 

J(X) = {x : f(x) > 0} = (essinf(X), esssup(AT)) = (a, to), 

where essinf(X) = mf x( zR{F(x) > 0} and esssup(X) = sup xg]R {F(x) < 1}. Also, it is 
obvious that f e C°°((at, uj)). 

The distributions satisfying (II. ip belong to the Pearson family. Moreover, the quadratic 
q generates the orthogonal polynomials through the Rodrigues-type formula, see [PT] . 

(-l) n d n 

^(x) = ^^^(x)/(x)], xeJ(X). 
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Similarly, if X is a discrete (integer-valued) r.v. with probability mass function 
(p.m.f.) p and finite mean p, we say that p has a Pearson quadratic form q(j) = 
S(j — ji) 2 + f3(j — p) +7 if it satisfies the identity 

J2^- k )P( k ) = ( l(j)P(j) for all j G Z. (1.2) 

Remark 1.2. Under (11. 2\ it can be shown that the support J(X) = {j 6 Z : p(j) > 0} 
is an interval of integers, i.e., if ji G J{X) and j 2 G J{X) then all integers j between 
ji and j 2 belong to J(X). The cases J(X) = {j } or J(X) = {jo, jo + 1} are trivial, 
because identity (II. 2p is always satisfied and q is not uniquely defined. We exclude 
these trivial cases from what follows. Thus, when we say that an integer-valued r.v. 
has a quadratic polynomial, we will assume that \ J(X)\ > 3. 

For a suitable function, g, defined on J{X), the following covariance identity holds 
(see [21 El [131 [IT]; cf. [221 [23]): 

E[(X-p)g(X)] = E[q(X)g'(X)], (cont. case) (1.3) 

E[(X-p)g(X)] = E[q(X)Ag(X)], (discr. case) (1.4) 

where A denotes the forward difference operator, Ag(x) = g(x + 1) — g{x). 

Furthermore, using the Mohr and Noll inequality (or the discrete Mohr and Noll 
inequality), one obtains Poincare-type lower/upper bounds for the variance of g(X) 
(see [U El H31 US]) of the form 

(-l) n [S n - Var^(X)] > 0, where 
S n -E tT— T^h] (9 {k+1 \X)) 2 ], (cont. case) 



or 



Sn = T, ,u M JJ )k n ^ lk+1] W (A fc+1 ^(X)) 2 ], (discr. case) 

where A k+1 g(x) = A (A k g(x)) , k = 1, 2, . . . , with A = /. 

It should be noted that the quadratic q also appears in variance bounds obtained 
using Bessel's inequality (see [2]). 

Clearly, if we know the mean p and the parameters 5, (3, 7 we can solve equation 
( ITU (or (TOD) for / (or p) (cf. ^ QSJ QU ESI E2D] ) • 

The purpose of the present paper is to obtain an estimator for the parameters of the 
quadratic q, i.e., for the vector (p, 6, (3, 7)*. With the help of (11. 3p . (11 .4p . we generate 
a system of equations, from which we obtain the moment estimators for p, S, (3, 7. 

Employing the delta-method, the asymptotic distribution of the estimators is de- 
rived. Some applications are also given. Similar work has been done by Pewsey, [18], 
who found the joint asymptotic distribution of the sample mean, variance, skewness 
and kurtosis. It is worth mentioning that Pewsey's results provide, for the first time, 
the joint asymptotic distribution for these fundamental statistics. 
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2 Moment Estimators 

Here we deal with the estimation of the parameters 8, /3, 7 using the method of moments. 
ML estimation is possible but, as is generally true for all but the most simple of 
distributions, there are no closed-form expressions for the MLEs and ML estimation 
reduces to a numerical optimization problem. Instead, in what follows we consider 
estimators obtained using the method of moments. 

Let X be a r.v. with EX 4 < 00 and EX = /i. If [i k = E(X — jj) k is the k-th central 
moment, then 

/i 4 ^2 - 14 - iA > °> (2- 1 ) 

and the equality holds only for the trivial case where X takes, with probability 1 
(w.p. 1), at most two values [the r.v. Y = ii 3 (X — ji) 2 — (yU 4 — fi 2 )(X — /i) has vari- 
ance VarF = (yU 4 — \£f){\i±ii 2 — ji\ — > 0, since Var(X — fi) 2 = /i 4 — fi 2 > 0] . 
Next, let X n = . . . ,X n ) be a random sample from any distribution. If m^ n = 

Sj=o \Xj ~ Xn) /n is the k-th sample central moment, then 

m iyn m 2 -n ~ - m\. n > 0, (2.2) 

and the equality holds if and only if we have observed at most two values in the sample 
(this follows directly from (12. ip ). 

Theorem 2.1. Let X be an integer-valued r.v. (or a continuous r.v.) with mean \i, 
finite fourth moment and p.m.f. p satisfying (II. 2p (or p.d.f. f satisfying (II- ip ). with 
q(j) = 5(j — ji) 2 + (3(j — fi) + 7. // X n = (X\, . . . , X n ) is a random sample from X , 
with at least three different values (or at least three values), then 

(a) for the integer-valued case, the moment estimators for 5,(3, 7 are 

5 n = (2m 4;n m 2;n - 3m 2 3 . n - 6m| n + m%.^) / QO n , 
dn = m 3 . n (l - 26 n ^ - m 2;n , /2m 2;n 

= (m 4;n m 3;n - 3m 4;n m 2;n + 3m 2 3 . n + 3m 3 . n m 2 2 . n - m 3 . n m 2 . n + Sml. n )/60 n , 
In = m 2 ;n (l - 8n) = (4m 4;ri m2 ;n - 3m\. n m 2 -n - m% n )/Q0 n , 

(b) for the continuous case, the moment estimators for 5, /?, 7 are 

8 n = (2m 4 ; n m 2;n - 3m 2 3 . n - Qml. n )/W n , 

dn = m 3 . n (l - 25 n J /2m 2 . n = (m 4 . n m 3;n + 3m 3;n m| ri )/60 n , 

7n = m 2]n (l - 5 n ^j = (Am 4]n m 2 2 . n - 3m 2 3 . n m 2 . n ) /§O n , 

where O n = m^ n m 2 . n — m 3 . n — m 2 . n is a positive number, X n is the sample mean and 
m k-,n = Y^j=i {Xj — X n ) /n is the k-th sample central moment. Also, the estimators 
bnifimln converge strongly to 5,(3,^, respectively. 
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Proof: (a) Since X has finite fourth moment, it follows that it has finite central 
moments up to the fourth order. Also, its p.m.f. is quadratic q and so the covariance 
identity (jl.4p applies to any suitable g. In particular, for g(x) = (x — fi) k+1 , k = 0, 1, 2, 
the covariance identity is satisfied. For these functions, and since A(x — fi) = 1, 
A(x - /i) 2 = 2{x - fi) + 1, A(x - /i) 3 = 3(x - /j,) 2 + 3(x - fi) + 1, yields the 

equalities, 

/i 2 5 + 7 = /i 2 , 
(2fj, 3 + fj, 2 )S + 2^ 2 /3 + 7 = /i 3 , 

(3/i 4 + 3/i 3 + /i 2 )5 + (3/i 3 + 3/i 2 )/3 + (3/i 2 + 1)7 = /i 4 . 
Solving this system of equations, we obtain 

5 = (2/i 4/ u 2 — 3^3 — 6/i 2 + ^2) / 66*, 

/3 = [fi 3 (l - 26) - /i 2 ]/2/i 2 = (/i 4 /i 3 - 3/i 4 /i 2 + 3/ig + 3/i 3 /i 2 - fi 3 fi 2 + 3^)/60, 

7 = /i 2 (i - 5) = (fitui - 14^2 - 14) 1 60 > 

where = /i 4 /i 2 — /x| — For the solution of this system it is necessary to have 
0^0. This follows directly from (12.11) . because J(X) > 3 (see Remark 1 1.21) . 

If in a random sample we have observed at least three different values, then n > 0, by 
(12. 2p . Replacing irik-n by in the above expressions we obtain the moment estimators 
5 n , (3 n and %. 

(b) Using similar arguments we observe that for g{x) — (x — fi) k+1 , k = 0,1,2, the 
covariance identity (11.31) is satisfied, which is 

/i fc+2 — (k + 1)E [q{X) (X - /i) fc ] = (k + 1) [/W + + ^kl] ■ (2.3) 

From (12.31) . with k = 0, 1,2, we generate a system of equations. Solving this system 
we obtain 

S = (2/i 2 /i 4 — 3/i 2 — 6/i 2 ) / 60, 

13 = /i 3 (l - 25)/2/x 2 = (^ 3 + 3/i 3y u 2 )/66), 

7 = /i 2 (l - 5) = (4/i 4 /i 2 - 3/i3/i 2 ) / 66>, 

where = yU 4 /i 2 — ji 2 — /i 2 . For the solution of this system we have to assure that 
0^0. This follows directly from (12. ip . since the r.v. X is continuous. 
Since a random sample of at least three values from a continuous distribution function 
consists of distinct values (w.p. 1), we have n > 0, a.s. 

For all k = 1, 2, 3, 4 the r.v. X k has finite mean \J k = ~EX k , and it is well known that 
m' k . n = Y^j=iXj /n — > /i' k , a.s. Finally, the estimators 5 n , /3 n ,j n can be written as 
rational functions of m' k , n , k = 1,2,3,4, and we conclude, using Slutsky's Theorem, 
that these functions converge strongly to 5,f3,j respectively. □ 
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Remark 2.1. If we carefully examine the expressions for 8, /?, 7 in the continuous case, 
we will see that 5 is a number that does not have "measurement units" (m.u. 's), f3 is 
measured using the m.u. 's of X and 7 is measured using the m.u. 's of the square of 
X . Bearing in mind that 5, (3, 7 are multiplied by (x — /1) 2 , (x — /i), 1 respectively, (i.e. 
q(x) = 5(x — fi) 2 + f3(x — /t) + j) we expect the final result to be measured in m.u. 's 
of the square of X. This indicates that the above choice of estimators is natural (see 



3 Asymptotic Distribution 

Next, we study the asymptotic distribution of the estimators using the delta-method: 
Let T n be a sequence of r.v. 's in R ; i?G0C R fc , and assume that 

s/n{T n - ■&) N k (0, $), as n 00, 

where Nk(0, is a k-dimensional normal distribution with mean vector and covari- 
ance matrix ^1. 

If 4> : R fe — > R m is (totally) differentiable in "& G 0, with total differential J ( p('&) = 
(dM x )/dxj)\ x= ^ G R mxfc , then (see [25], p. 25) 

V^(0(T n ) - 0(0)) A N m (0, J^(0)$jj(0)), as n^oo. 
So, we can easily deduce the following result. 

Theorem 3.1. Let X n = (Xi,...,X n ) be a random sample, with at least three dis- 
tinct values (or at least three values), from an integer-valued (or continuous) r.v. 
X, with p.m.f. satisfying (11.21) (or p.d.f. satisfying (11.11) ) and finite EX 8 . // q n = 
(X n ,5 n ,(3 n ,%) , with 5 n ,/3 n ,% as in Theorem ^V^a) (or (b)), and to = (fi, 5, /3, 7)*, 
then 

y/n(q n — <jl>) -—y iV 4 (0, D), as n — > 00, 

where D = (J v (&) J^))^,(J tp {'&) J^. (/*))* V = (0, M2, M3, A^)* & = (j", A*2, A*3, A^)*, $ = 
(fii+j - lHHj)ij (with fix = 0), J (/x) has elements jf 4 = I, jf 2 = 0, jf 3 = 0, jf 4 = 0, 

321 = °> 322 = 1; J23 = 0, 324 = °; Jsi = ~3^2, jf 2 = 0, J33 = 1, J34 = 0, jf x = -4// 3 , 

J42 = 0? J43 = 0; J44 = 1; anc ^ 66> 2 J ¥ ,(i9) (with = fJL$fi2 — /i 2 — fi\) has elements: 
(a) for the discrete case, jf t = 60 2 , jf 2 = 0, jf 3 = 0, jf 4 = 0, = 0, j% 2 = 

H/4 - 8 A*4A*2 + ^4^2 + 9/^2 ~ 2/^2 + (4> 323 = -2/i 4 /i3^2 - 6// 3 £t| + 2/i 3 /i|, = 
A*iA*2 + 4 /4 - a4 331 = 0, = ~fl\fl3 + 6^4/^2 ~ 6A«3/X2 + ^3 + 3 A^/4 ~ A*3A*2> 

J33 = /^4/ i 2+/i4/X3 + 2/i 4 /il-/i4/il + 3/i^- ^/i2-3/i^+/i^ j| 4 = -/if - Afl^\ + /I 3 f4, 

jti = o, & = /4i4- 2^/4^ +Hit4- 2 f^t4+f4- 2 i4i4 +3/4/4, 3^ = 2 ^/4~ 2 ^/4, 

J44 = ~t L 2 + / i 2> 
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(b) for the continuous case, jf x = GO 2 , jf 2 = 0, jf 3 = 0, jf 4 = 0, jt\ = °; J22 = 

IHf4 ~ 8^4/4 + 9f4t4> J23 = -2/i 4 /i3^2 - 6/i 3 /4 J24 = l4t*2 + *l4> 3$L = Q > J32 = 
-A*4A*3 + 6^4A*3^-6^1a*2 + 3a«3|4, 333 = A*I>2 +1^4/4 + ^l4 + 3 / i i/ i 2 ~ 3 /4 J34 = -A*3~ 
4/U3M2; Oil = °> J42 = 4^4/^2 ~ A*2 + 4// 4 /4 + 3 /4 - 6/*1a*2> ^43 = 2 /M/^| + 6^3/4; 

j'44 = -MlMi-4/x|. 

Proof: We centralize the Xj- values as lj = Xj — fi, for j = 1, . . . , n. Then for the 
vector Y n = (Y n , Y 2 , F n 3 , Y^f , it is well known that 

\fn(Y n — (j,) -^4 ^4(0, ^1), as n — > 00. 

We consider the sample central moments m k . n = YTj=i {Xj ~ / n, k = 2,3,4 

and we seek the asymptotic distribution of the vector m n = (y n ,m 2;Il ,m3. n ,m4.„) t . 
Observe that 

- fc ;n = B-i) i ( fc )>r I (^) J - 

i=0 ^ 1 ' 

Hence, for k = 2, 3, 4, we get m 2 , n =Y% - {Y n )\ m 3;n = Y* - 3Y 2 (F n ) + 2 (F n ,) 3 , 
m4 ; „ = — 4F, 3 (^n.) + GY 2 (V n ) — 3 (y n ) . Thus, the vector m n can be written as 
m n = ^(F n ,r n 2 ,y n 3 ,r n 4 ), where if>(x) = (^i(aj), ^(aj), ^(a;), M*))* with ^x(aj) = 
xi, ip2(x) = £2 — ^(aO = £3 — 3x2Xi + 2xf, ^(x) = X4 — 4x3X1 + 6X2X 2 — 3xf. 

Applying the cfeZta-method, it follows that ^Jn(m n — /x) -^4 N4 (0, J^(/x)$J^(/x)), as 
n ->■ 00, where i^UO = {dtpi/dxj) | . 

Since T n = (X„, m 2; „, m 3;n , m 4;n ) t = ra n + (/i, 0, 0, 0)* (and = /x + (/i, 0, 0, 0)*), we 
obtain y/n(T n — $) -^4 (0, J^,(/Lt)$Jj,(/u)), as n — > 00. 

Regarding the asymptotic distribution of the vector ^Jn{q n — c*>), we have q n = <p(T n ) 
and w = where <£>(cc) = (<£>i(cc), ^2(2;), ^3(2;), ^(a:)) an d the coordinates <£>i(x) 

are given by: 

(a) for the integer-valued case, <fi(x) = xi, ^2(2:) = (2x4X2 — 3x 2 — 6x3 + x 2 )/c(x), 
^3(2:) = (X4X3 — 3x4X2 + 3x| + 3x3X3 — X3X2 + 3x|)/c(cc), ^4(2:) = (X4X 2 — X3X2 — 
xl)/c(x), 

(b) for the continuous case, y?i(a:) = xi, ^2(2;) = (2x4X2 — 3xg — 6x3) /c(x), ^3(2:) = 
(x 4 x 3 +3x3X2) /c(x), ip 4 (x) = (4x4X2 - 3xlx 2 )/c(x), 

with c(x) = 6(x4X2 — x\ — X2). Thus, 

v^(q„ - w) A iV 4 (0, (^(^^(/x))^ J^J^fj,)) , as n 00, 
where J^'d) = (dipi/dxj) \ _ „ for both cases. □ 
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4 Hypothesis Testing 



In the subsections that follow we present various hypothesis tests based on the asymp- 
totic distribution of the parameter estimates. 

4.1 Continuous Case 
4.1.1 Test for Normality 

A test of normality is equivalent to testing 

H : 5 = (3 = vs H i : At least one of 5 or (3 is non-zero. 

TheoremO^) shows that ^(5 n -5, (3 n -f3) iV 2 (0, B s ,p), as n — > oo, where = 
{AJ V (&) J^(/j,))^i(AJ v ('d) J^{fJ,))\ with A = (g J § g). Thus, under null hypothesis, we 
have that 

Q n = n(5 n , f3 n ,)Bjp. (5 n , /3 n Y — > xL as n ^ °°> 

where = ( 2 ^ 3 3ct 2/ 2 ) and Xm is the chi-square distribution with m degrees of free- 

dom. Since B>s,p-,o is unknown we estimate it by = ( 2 ^ 3 3s 2/ 2 ), replacing a 2 by 

s 2 = ^j-m 2;r i. For testing the above hypothesis, we propose the statistic 

q n = 7l(S n , fl n , )D 5 ,/3;0 (S n , P n )\ 

and, at significance level a, the asymptotic rejection region is R = {q n > X2 ;a }) where 
Xm-a is the upper 100a% point of the xL distribution. 

The distribution of q n is asymptotically xf,. Table 1 contains the 90th, 95th, 97.5th 
and 99th percentiles of the empirical distribution of q n generated by simulation of 10 5 
samples of size n = 10,20,30,50,70,100,150,200,300,400,500,750 and 1000 from a 
normal distribution. 

Table 1. Empirical percentiles of the distribution of q n for random samples of size n drawn from a 
normal distribution. 



n 


10 


20 


30 


50 


70 


100 


150 


200 


300 


400 


500 


750 


1000 


oo 


-Po.90 


17.26 


9.18 


7.31 


6.06 


5.53 


5.20 


4.91 


4.85 


4.75 


4.66 


4.67 


4.63 


4.62 


4.60 


-Po.95 


27.48 


13.35 


10.37 


8.36 


7.56 


7.03 


6.53 


6.40 


6.23 


6.14 


6.11 


6.06 


6.03 


5.99 


-Fb.975 


41.25 


18.24 


13.83 


10.94 


9.75 


8.99 


8.23 


8.03 


7.83 


7.61 


7.58 


7.54 


7.47 


7.38 


-Po.99 


67.70 


26.03 


19.04 


14.52 


12.77 


11.78 


10.69 


10.33 


9.96 


9.72 


9.55 


9.42 


9.39 


9.21 



For a sample of small size n, the proposed a-level normality test is 

reject normality if q n > Pi- a , 
where P\- a is given in Table 1. 
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4.1.2 Test for S = 



It is of interest to know if 5 = 0, because this simplifies the procedure of inverting the 
quadratic q and arranging (categorizing) the distribution. Hence we consider testing 

the null hypothesis 5 = 0. Theorem 13.1( b) shows that \/ri{5 n — 5) -^4 N(0, Var5 n ), 
as n — > oo, where Var<5 n is the (2, 2) element of matrix D. 

Note that if q in (11. ip is linear (that is, 5 = 0) then X follows either a normal or a 
gamma-type distribution of the form X = cY + d, where Y is gamma and c ^ 0, d are 
constants. In both cases, Var (£ n 1 5 = 0) = (2/3) [l + 13jLt|/4/^| + 7//f/2yu|(4//| + A*D] = 
ctq (5 n ) . Thus, under null hypothesis, 

Z n = VnS n /a (5 n ) -^4 iV(0, 1), as n ->■ oo. 

However, 0"o(5 n ) is unknown, we have to estimate it by Oq^), replacing m^ n by /Xfc. 
Thus, we proposed the statistic 




and, at significance level a, the (asymptotic) rejection region is R = {\z n \ > z a /2}, 
where z a is the upper 100a% point of the standard normal distribution. 

4.1.3 Test for Symmetry 

First we prove the following lemma. 

Lemma 4.1. Let X be a continuous r.v. with mean [i and p.d.f. / satisfying (II. ip . with 
q(x) = 5{x — /i) 2 + /3(x — fi) + 7. Then, X is symmetric if and only if (3 = 0. 

Proof: Let X be symmetric. In this case \x = EX is the point of symmetry. Taking 
derivatives in (II. ip we get (x — /i)/(x) = q'(x)f(x) + q(x)f'(x) for all x G J(X). 
Since fi G J(X), (/x — fi)f(fi) = 5 , '(/ x )/(/ i ) + Q.{^)f{^)- By the symmetry of X, 
f'(fjt) = 0. So Q''(/i)/(/i) = and since f(x) > for all x G J(X) we get /(/x) > and 
q'{n) = 25(fi -fi) + P = P = 0. 

Conversely, let j3 = 0. Then Y = X — /x has /xy = = and its density satisfies (II. ip . 
with qy{y) = 5y 2 + 7. We only have to show that Y is symmetric at 0. The r.v. — Y 
has /x_y = and its density satisfies (11.11) . with q_ Y (y) = qyi—y) = Qy(u)- That is, 

they have the same mean and the same Pearson quadratic form. Hence, — Y = Y. □ 

We note that if / satisfying (II. ip where 5 > then E|X| a < 00 for some a > 1 
if and only if a < 1 + 1/5, see [2]. Therefore, (3 = /x 3 = and a test of 
symmetry is equivalent to testing H : /x 3 = vs Hi : /i 3 7^ 0. From the proof 

of Theorem 13.11 (b) we have that y/n(m 3 . n — fi 3 y -^4 iV(0, <7^ ), as n — > 00, where 
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a m 3 ;n = - 6/i 4 yU2 - A*| + 9f4- Tmi s, under null hypothesis, Z n = y/nm 3 . n /a m3 . n - -^4 
iV(0, 1), as n — > oo, where (since /i 2 r+i = 0) . = /i 6 — 6 / u 4/ u 2 + 9/x|. Since c^ 3 . n .o is 

unknown, we estimate it by c^j 3 . n .o> replacing m 2r;n by /i 2r , and the proposed statistic 
is 

r- m 3 . n 
'n , 



with asymptotic rejection region R = {\z n \ > z a / 2 ], at significance level a. 

4.2 Discrete Case 

4.2.1 Test for Poisson Distribution 

An integer- valued r.v. X with p.m.f. satisfying (II. 2p . follows a Poisson distribution if 
and only if 5, P and a 2 — fi equal to zero. Consider the test 

H : 5 = (3 = a 2 — fj, = vs Hi : At least one of 5 or or a 2 — is non-zero. 

Let a function r — (n, t 2 , Ta)* : R 4 — )■ R 3 with Ti(x) = ^ i+ i(cc), i = 1,2 (where <£>j + i 
as in proof of Theorem 13.1( a)) and 73(21) = x 2 — xi- Note that r(i?) = (5, (3, a 2 — /i)* 
and r(T n ) = (5 n , f3 n , m 2 . n — where 5 n , f3 n the estimators in Theorem 12.1( a). 

Therefore, a test for Poisson is reduced to the null hypothesis r(i?) = 0. Using the 

delta-method, y/E(r(T n ) -t(0)) ^> iV 3 (0,D T ), asn -> oo, where D T = C(tf )$(?*(#), 
with (7(1?) = J T (-d)J^(fj,) and J T ($) = (dri/dxj) | _„. The first two rows of matrix 
J T (i9) are the rows two and three of matrix J^ft) in Theorem 13.1( a). the third row of 
this matrix is (—1, 1, 0, 0). Thus, under null hypothesis, 

Qn = n(5 n ,0 n ,m 2 ;n ~ X n )l])~^(5 n , j3 n , m 2;n ~ X n ) 1 -^4 xjj, as ^ oo. 

where D r .n = ttt f sa - 9 9A 2 - 2 a i2°a 2 ^ and A is the parameter of the Poisson distribution. 

6 H o i2A 2 mV 

Since D T;0 is unknown, we estimate it by P T; o, replacing A by X n , and the proposed 
statistic is 

Qn = n(6 n , f} n , m 2 . n - X n )B T . (6 n , (3 n , m 2 . n - X n ) , 
with asymptotic rejection region R = |g n > xi- a }j a ^ significance level a. 



5 Simulation Results 

Continuous case Using Matlab, we simulated 10 4 samples of size n from various 
continuous distributions admitting a Pearson quadratic form (see Figure 1). 

In Tables 2-10 we present the observed averages of the estimators 5 n , f3 n and 7„ 
(in Theorem 12.1( b)). their observed mean square errors (MSEs) and the empirical 
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level of rejection for the test of 5 = for a nominal significance level of a = 0.05. 
Considering the content of these Tables, we see that, as n becomes large, the means of 
the estimators tend to their true values. It would appear that the estimators of S and 
7 have negative and positive bias, respectively. Also, as expected, the mean square 
error of these estimators tends to zero, as n tends to infinity. 

There exists a well-established literature addressing the problem of testing univari- 
ate data for normality. Renewed recent interest in this inferential problem can be found 
in [2TJ [21], see also [13]. In the light of the findings presented in those papers, we con- 
ducted a simulation study designed to compare the performance of the new proposed 
test of normality with those of the following competitive tests for a nominal significance 
level of 5%, see Figures 2(a)-10(a). The description of each test is preceded by the 
abbreviation we will use when referring to it: 

KS. The (one-sided) empirical distribution function (e.d.f.)-based test of Kolmo- 
gorov-Smirnov. 

BS. The test of Bowman and Shenton (see [1]), or the (one-sided) Jarque and 
Bera (see [12]), the test statistic of which is a function of the coefficients of 
skewness and kurtosis. 

D. The (two-sided) test of D' Agostino (see [7J [8]). Up to a constant, the test 
statistic is the ratio of Downton's (see [9]) linear estimator of the standard 
deviation to the sample standard deviation. The critical values for this test 
are given in D' Agostino's papers. 

AD. The (one-sided) empirical distribution function (e.d.f.)-based test of Ander- 
son and Darling (see [3] ) . We used the corrected critical values for this test 
presented under the name CMWS in Table 2 of see [2T] . 

CvM. The (one-sided) Cramer- von Mises e.d.f. -based test with statistic identified 
as CMS in [21]. We used the corrected critical values given in their Table 2. 

Za-i Zq- The (one-sided) nonparametric likelihood-ratio-based tests with test statis- 
tics Za and Z c of Zhang and Wu (see [23])- We used the corrected critical 
values for these tests given in Tables 1 and 2, respectively, of that paper. 

We will compare the size and power performance of the proposed symmetry test 
(again, for a nominal significance level of 0.05) with that of one other general test of 
symmetry, see Figures 2(b)-10(b). This particular test were chosen because they were 
found to perform well in extensive simulation comparisons reported in Cabilio and 
Masaro (1996). That is: 

CM. The test statistic of Cabilio and Masaro (see [5]), is the simple function Sk = 
y/n(X — m) / s, where X, m and s denote the sample mean, median and standard 

deviation (with divisor n), respectively. Under symmetry, Sk N(0,ctq(F)), 
as n — y oo, where <Jq(F) is a constant (depending on the distribution F). The 
critical values for this test are given in [5]. Notice that this dependence is a weak 
point of the test statistic. 



10 



Figure 1 portrays the densities of the variates used to produce the numerical results 
presented in Tables 2-10 and Figures 2-10. 






N(0,1) 



£7(0,1) 



5(5,5) 



B(.2,.2) 



5(2,8) 




exp(l) G(10,l) C7(30,l) G(50, 1) 

Figure 1. Densities of standard normal and various beta and gamma distributions. 

For the normal distribution the parameters n and a 2 do not affect the critical region 
of the tests, so we fix /i = and a 2 = 1. 

Table 2. Mean and mean square error (MSE) for the three estimators calculated from 10 4 random 
samples of size n simulated from the standard normal distribution (i.e. 8 = 0, j3 = 0, 7 = 1). The last 
row gives the empirical size, ps, of the test for 6 = with a nominal significance level of a = 0.05. 



n 


50 


100 


150 


200 


300 


400 


500 




-0.0818 


-0.0417 


-0.0283 


-0.0208 


-0.0144 


-0.0114 


-0.0089 


MSE(S n ) 


0.0188 


0.0077 


0.0049 


0.0035 


0.0023 


0.0018 


0.0014 




-0.0018 


0.0015 


-0.0005 


-0.0002 


0.0013 


-0.0005 


-0.0000 


MSE0 n ) 


0.0336 


0.0160 


0.0103 


0.0077 


0.0051 


0.0038 


0.0030 


% 


1.0611 


1.0290 


1.0199 


1.0169 


1.0106 


1.0099 


1.0070 


MSE{%) 


0.0614 


0.0281 


0.0181 


0.0139 


0.0091 


0.0069 


0.0054 


PS 


0.0778 


0.0616 


0.0577 


0.0516 


0.0516 


0.0476 


0.0504 




50 100 150 200 300 
(a) 



500 n 




50 100 150 200 300 
(b) 



100 



500 n 



Figure 2. Empirical type I error calculated from 10 4 random samples of size n simulated from the 
standard normal distribution, for (a) tests for normality and (b) tests for symmetry with a nominal 
size of 0.05. The results in (a) correspond to the proposed new test (•) and the seven existing tests KS 
(□), BS (■), D (o), AD (a), CvM (a), Z a (©), Z c (♦). The results in (b) correspond to the proposed 
new test (•), and the CM test (o). 

From Table 2, the test for 5 = maintains the nominal level well for a sample size of 
n > 200 and is liberal for n < 150. 

From Figure 2, the tests for normality hold the nominal level well, apart from the BS 
test which is conservative. As the sample size n increases the empirical sizes of the 
tests for symmetry tend to the nominal level of a = 0.05, as expected. The CM test for 
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symmetry maintains the nominal level better than the new proposed test for symmetry 
which is conservative. 

Next we simulate data from beta and gamma distributions; clearly, they satisfy 

Let X be beta distribution with parameters a and b. The r.v. X is symmetric if 
and only if a = b. Also, for some values of a and b the density function of X is close 
to some normal distribution, for example see Figure 4.1. For various values of a, b we 
have: 

Table 3. Mean and mean square error (MSE) for the three estimators calculated from 10 4 random 
samples of size n simulated from the standard uniform distribution (i.e. S = —0.5, /3 = 0, 7 = 0.125). 
The last row gives the empirical power, p* s , of the test for 6 = with a nominal significance level of 
a = 0.05. 



n 


50 


100 


150 


200 


300 


400 


500 


Sn 


-0.5365 


-0.5170 


-0.5114 


-0.5093 


-0.5054 


-0.5039 


-0.5037 


MSE(S n ) 


0.0349 


0.0153 


0.0101 


0.0078 


0.0049 


0.0036 


0.0029 


X 


0.0001 


-0.0004 


-0.0002 


0.0006 


-0.0001 


-0.0003 


-0.0001 


MSE0 n ) 


0.0038 


0.0018 


0.0012 


0.0009 


0.0006 


0.0004 


0.0003 


In 


0.1271 


0.1259 


0.1256 


0.1256 


0.1253 


0.1252 


0.1252 


MSE{%) 


0.0009 


0.0004 


0.0003 


0.0002 


0.0001 


0.0001 


0.0001 


P*8 


0.9712 


0.9997 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 



1 

.75 



g .50 

a 



.25 



i'/r} 
1 1 j 

l! ' 

A 1 , 



O 

H -or 

CD 

7 .05 

CD 

a 

j?.03 



100 



500 n 



■50 100 150 200 300 400 500 n 50 100 150 200 300 

(a) (b) 
Figure 3. Empirical power of the tests for normality (a) and the size of tests for symmetry (b), for 

a nominal size of 0.05, calculated from 10 random samples of size n simulated from the standard 

uniform distribution. The tests are the same as those described in the caption to Figure 2. 



From Table 3, as the sample size n increases, the power of the test for S 
to one rapidly. 



increases 



From Figure 3, as the sample size n increases, the power of the tests for normality 
increases to one and the empirical size of the tests for symmetry tend to the nominal 
level of a = 0.05. For n = 50 the proposed test for normality is more powerful than 
the other tests for normality. The tests BS (for a sample size of n — 50) and KS 
(for a sample size of n < 300) have a poor performance. The new proposed test for 
symmetry maintains the nominal level better than the CM test for symmetry which is 
conservative. 
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Table 4. Mean and mean square error (MSE) for the three estimators calculated from 10 4 random 
samples of size n simulated from the beta distribution with a = b = 5 (i.e. 5 = —0.1, f3 = 0, 7 = 0.025). 
The last row gives the empirical power, of the test for 5 — with a nominal significance level of 
a = 0.05. 



n 


50 


100 


150 


200 


300 


400 


500 


s n 


-0.1599 


-0.1274 


-0.1190 


-0.1141 


-0.1086 


-0.1060 


-0.1052 


MSE{S n ) 


0.0169 


0.0066 


0.0041 


0.0030 


0.0019 


0.0014 


0.0011 




-0.0001 


-0.0000 


0.0001 


0.0001 


-0.0001 


0.0000 


-0.0000 


MSE0 n ) 


0.0006 


0.0003 


0.0002 


0.0001 


0.0001 


0.0001 


0.0001 


In 


0.0259 


0.0254 


0.0253 


0.0253 


0.0251 


0.0251 


0.0251 


MSE(%) 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


P*s 


0.2073 


0.2812 


0.3762 


0.4574 


0.6193 


0.7560 


0.8368 




50 100 150 200 300 400 500 n 50 100 150 200 300 400 500 n 



(a) (b) 
Figure 4. Empirical power of the tests for normality (a) and the size of tests for symmetry (b), 

for a nominal size of 0.05, calculated from 10 4 random samples of size n simulated from the beta 

distribution with a = b = 5. The tests are the same as those described in the caption to Figure 2. 

From Table 4, as the sample size n increases, the power of the test for 5 = increases. 
That test for a sample size of n < 100 has a poor performance. 

From Figure 4, as the sample size n increases, the power of the tests for normality 
increases. The new proposed and D tests for normality are more powerful than the 
other tests for normality. All tests (new proposed for normality and D for a sample 
size of n < 200; Za and Zc for a sample size of n < 300; BS, KS and AD for a sample 
size of n < 500) have a poor performance. This happens because the density of beta 
distribution 5(5,5) is close to the normal distribution iV(l/2, 1/44) (see Figure 4.1). 
Notice that the power of the KS test is less than the nominal level for a sample size of 
n < 500. Both tests for symmetry maintain the nominal level equally well. Both are 
conservative. 




0.5 1 



Figure 4.1. Densities of the beta distribution with parameters a = 5, (3 = 5 (solid line) and the 
normal distribution with parameters /i = 1/2, a 2 = 1/44 (dashed line). 
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Table 5. Mean and mean square error (MSE) for the three estimators calculated from 10 4 random 
samples of size n simulated from the beta distribution with a = b = 0.2 (i.e. 5 = —2.5, ft = 0, 
7 = 0.625). The last row gives the empirical power, of the test for 5 = with a nominal 
significance level of a = 0.05. 



n 


50 


100 


150 


200 


300 


400 


500 


Sn 


-2.6784 


-2.5798 


-2.5523 


-2.5310 


-2.5209 


-2.5220 


-2.5177 


MSE{8 n ) 


0.9962 


0.3920 


0.2377 


0.1693 


0.1097 


0.0861 


0.0678 




0.0009 


-0.0010 


0.0005 


-0.0001 


0.0010 


-0.0015 


0.0003 


MSE(X) 


0.1266 


0.0568 


0.0358 


0.0261 


0.0172 


0.0129 


0.0104 


In 


0.6545 


0.6376 


0.6332 


0.6293 


0.6280 


0.6287 


0.6280 


MSE{%) 


0.0505 


0.0203 


0.0124 


0.0088 


0.0058 


0.0045 


0.0035 


1= 

Ps 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 




50 100 150 200 300 400 500 n 50 100 150 200 300 400 500 n 

(a) (b) 

Figure 5. Empirical power of the tests for normality (a) and the size of tests for symmetry (b), 

for a nominal size of 0.05, calculated from 10 4 random samples of size n simulated from the beta 

distribution with a = b = 0.2. The tests are the same as those described in the caption to Figure 2. 

From Table 5, for each sample size n the power of the test for 5 = is one. 

From Figure 5, as the sample size n increases, the power of the tests for normality 
increases to one and the empirical size of the tests for symmetry tend to the nominal 
level of a = 0.05. The test D for a sample size of n < 150 has a poor performance. The 
new proposed test for symmetry maintains the nominal level better than the CM test 
for symmetry. However, the new test is liberal whereas the CM test is conservative. 

Table 6. Mean and mean square error (MSE) for the three estimators calculated from 10 4 random 
samples of size n simulated from the beta distribution with a = 2,6 = 8 (i.e. S = —0.1, /3 = 0.06, 
7 = 0.016). The last row gives the empirical power, p* & , of the test for <5 = with a nominal significance 
level of a = 0.05. 



n 


50 


100 


150 


200 


300 


400 


500 


Sn 


-0.2052 


-0.1552 


-0.1385 


-0.1284 


-0.1204 


-0.1158 


-0.1116 


MSE(6 n ) 


0.0298 


0.0128 


0.0082 


0.0059 


0.0040 


0.0031 


0.0024 


X 


0.0622 


0.0617 


0.0610 


0.0609 


0.0606 


0.0604 


0.0603 


MSE(X) 


0.0009 


0.0004 


0.0002 


0.0002 


0.0001 


0.0001 


0.0001 


% 


0.0172 


0.0167 


0.0165 


0.0164 


0.0162 


0.0162 


0.0161 


MSE{%) 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


0.0000 


P*s 


0.1511 


0.1606 


0.1678 


0.1909 


0.2403 


0.2943 


0.3520 



From Table 6, as the sample size n increases, the power of the test for 5 = increases. 
That test for each sample size of n has a poor performance. 
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50 100 150 200 300 400 500 n 50 100 150 200 300 400 500 n 



(a) (b) 

Figure 6. Empirical power of the tests for normality (a) and symmetry (b), for a nominal size of 0.05, 
calculated from 10 4 random samples of size n simulated from the beta distribution with a = 2, b = 8. 
The tests are the same as those described in the caption to Figure 2. 

From Figure 6, as the sample size n increases, the power of the tests for normality 
and for symmetry increases to one. For n = 50 the Za test is more powerful than the 
other tests for normality. The tests KS, D (for a sample size of n < 200) and BS (for 
a sample size of n — 50) have a poor performance. The power of the KS test is less 
than the nominal level for a sample size of n < 100. The new proposed test is more 
powerful than the CM test for symmetry. 

Let X be gamma distribution with parameters a and 9. The r.v. X is asymmetric 
for all a and 9 (notice that the test of symmetry of Cabilio and Masaro cannot be 
used). For large values of a, via Central Limit Theorem, the density function of X is 
close to a normal distribution. 

The parameter 9 does not affect the critical region of the tests (so we fix 9 = 1), in 
contrast to the parameter a. In order to investigate the level of rejection of the tests, 
as a increases, we choose to simulate the gamma distribution for 9 = 1 and a = 1, 10, 
30, 50. 

Table 7. Mean and mean square error (MSE) for the three estimators calculated from 10 4 random 
samples of size n simulated from the exponential distribution with 6 — 1 (i.e. 5 = 0, /3 = 1, 7 = 1). 
The last row gives the empirical size, p$, of the test for (5 = with a nominal significance level of 
a = 0.05. 



n 


50 


100 


150 


200 


300 


400 


500 


Sn 


-0.2857 


-0.1855 


-0.1453 


-0.1201 


-0.0910 


-0.0753 


-0.0650 


MSE(S n ) 


0.1131 


0.0521 


0.0349 


0.0258 


0.0172 


0.0133 


0.0110 




1.2820 


1.1912 


1.1599 


1.1371 


1.1128 


1.0938 


1.0820 


MSE0 n ) 


0.5744 


0.2271 


0.1433 


0.1021 


0.0637 


0.0450 


0.0342 


In 


1.2755 


1.1689 


1.1314 


1.1105 


1.0853 


1.0696 


1.0613 


MSE{%) 


0.3895 


0.1510 


0.0936 


0.0674 


0.0419 


0.0309 


0.0243 


PS 


0.0717 


0.0385 


0.0266 


0.0212 


0.0170 


0.0132 


0.0122 



From Table 7, the test for 5 = is liberal for a sample size of n = 50 and is conservative 
for n > 100. 

From Figure 7, as the sample size n increases, the power of the tests for normality and 
of the new proposed test for symmetry increases to one. The test KS for a sample size 
of n = 50 has a poor performance. 
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Figure 7. Empirical power of the tests for normality (a) and symmetry (b), for a nominal size of 
0.05, calculated from 10 random samples of size n simulated from the exponential distribution with 
9 = 1. The tests are the same as those described in the caption to Figure 2. 

Table 8. Mean and mean square error (MSE) for the three estimators calculated from 10 4 random 
samples of size n simulated from the gamma distribution with a = 10,9 = 1 (i.e. S = 0, (3 = 1, 
7 = 10). The last row gives the empirical size, p$, of the test for 6 = with a nominal significance 
level of a = 0.05. 



n 


50 


100 


150 


200 


300 


400 


500 




-0.1075 


-0.0618 


-0.0436 


-0.0348 


-0.0259 


-0.0194 


-0.0153 


MSE(6 n ) 


0.0251 


0.0113 


0.0071 


0.0055 


0.0038 


0.0029 


0.0023 




1.0292 


1.0202 


1.0140 


1.0085 


0.9986 


1.0065 


1.0042 


MSE0 n ) 


0.5144 


0.2239 


0.1483 


0.1053 


0.0681 


0.0510 


0.0405 


In 


10.8993 


10.5233 


10.3646 


10.2952 


10.1970 


10.1635 


10.1317 


MSE{%) 


8.6087 


3.8083 


2.5283 


1.8026 


1.2040 


0.9187 


0.7057 


PS 


0.0808 


0.0582 


0.0478 


0.0433 


0.0384 


0.0322 


0.0357 
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Figure 8. Empirical power of the tests for normality (a) and symmetry (b), for a nominal size 
of 0.05, calculated from 10 4 random samples of size n simulated from the gamma distribution with 
a = 10, 9 = 1. The tests are the same as those described in the caption to Figure 2. 

From Table 8, the test for S = maintains the nominal level well for a sample size of 
n = 150, is liberal for n < 100 and conservative for n > 200. 

From Figure 8, as the sample size n increases, the power of the tests new proposed, BS, 
AD, CvM, Za and Zq for normality and of the new proposed for symmetry increases 
to one; also, the power of the tests KS and D for normality increases. The Za, Zq and 
new proposed tests for normality are more powerful than the other tests for normality 
The power of KS test for a sample size of n < 200 is less than the nominal level. All 
the tests (new proposed for normality, BS, AD, CvM, Za and Zc for a sample size of 
n = 50; D for n < 200; KS for 300 < n < 500; new proposed for symmetry for n = 50) 
have a poor performance. 
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Table 9. Mean and mean square error (MSE) for the three estimators calculated from 10 4 random 
samples of size n simulated from the gamma distribution with a = 30,0 = 1 (i.e. 8 = 0, /3 = 1, 



7 = 30). The last row 


gives the 


empirical 


size, ps, 


of the test for S = 


with a 


nominal 


level of a = 0.05. 
















n 


50 


100 


150 


200 


300 


400 


500 


s n 


-0.0919 


-0.0488 


-0.0336 


-0.0257 


-0.0173 


-0.0146 


-0.0121 


MSE{S n ) 


0.0214 


0.0088 


0.0056 


0.0042 


0.0028 


0.0022 


0.0017 




1.0068 


0.9964 


0.9951 


0.9996 


1.0031 


1.0003 


0.9946 


MSE0 n ) 


1.1458 


0.5417 


0.3446 


0.2526 


0.1655 


0.1241 


0.1025 


In 


32.0871 


31.1106 


30.7382 


30.6222 


30.4259 


30.3779 


30.3113 


MSE(%) 


60.6143 


28.0028 


18.1469 


13.6622 


8.8630 


7.0383 


5.3907 


PS 


0.0773 


0.0620 


0.0477 


0.0486 


0.0413 


0.0415 


0.0373 
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Figure 9. Empirical power of the tests for normality (a) and symmetry (b), for a nominal size 
of 0.05, calculated from 10 4 random samples of size n simulated from the gamma distribution with 
a = 30, 9 = 1. The tests are the same as those described in the caption to Figure 2. 

From Table 9, the test for S = maintains the nominal level well for a sample size of 
150 < n < 200, is liberal for n < 100 and conservative for n > 300. 

From Figure 9, as the sample size n increases, the power of the tests increases. For 
n < 150 the Z& and Zc tests are more powerful than the other tests for normality 
and for n > 300 the new proposed test for normality is more powerful than the other 
tests for normality. The power of KS test for a sample size of n < 500 is less than the 
nominal level. The tests new proposed for normality, Zc (for n < 100), BS (for 
n < 150) AD, CvM (for n < 200), D (for n < 500) and new proposed for symmetry 
(for n < 150) have a poor performance. 

Table 10. Mean and mean square error (MSE) for the three estimators calculated from 10 4 random 
samples of size n simulated from the gamma distribution with a = 50,9 = 1 (i.e. S = 0, /3 = 1, 
7 = 50). The last row gives the empirical size, ps, of the test for 6 = with a nominal significance 
level of a = 0.05. 



n 


50 


100 


150 


200 


300 


400 


500 


Sn 


-0.0873 


-0.0451 


-0.0320 


-0.0238 


-0.0169 


-0.0123 


-0.0098 


MSE{8 n ) 


0.0199 


0.0083 


0.0053 


0.0039 


0.0026 


0.0019 


0.0015 


X 


1.0241 


0.9797 


0.9932 


0.9919 


0.9931 


0.9981 


0.9994 


MSE0 n ) 


1.8321 


0.8343 


0.5527 


0.4124 


0.2649 


0.1994 


0.1610 


In 


53.2245 


51.7797 


51.3292 


50.9261 


50.6445 


50.4826 


50.3935 


MSE{%) 


163.7599 


75.7973 


51.7595 


36.5868 


23.9070 


18.0106 


14.1179 


PS 


0.0746 


0.0542 


0.0492 


0.0504 


0.0462 


0.0432 


0.0449 
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Figure 10. Empirical power of the tests for normality (a) and symmetry (b), for a nominal size 
of 0.05, calculated from 10 4 random samples of size n simulated from the gamma distribution with 
a = 50, 9 = 1. The tests are the same as those described in the caption to Figure 2. 

From Table 10, the test for 5 = maintains the nominal level well for a sample size of 
100 < n < 300, is liberal for n = 50 and conservative for n > 400. 

From Figure 10, as the sample size n increases, the power of the tests increases. For 
n < 150 the Za and Zc tests are more powerful than the other tests for normality 
and for n > 300 the new proposed test for normality is more powerful than the other 
tests for normality. The power of KS test for a sample size of n < 500 is less than 
the nominal level. The tests new proposed for normality, BS, AD, CvM, Za, Zq (for 
n < 200), D (for n < 500) and new proposed for symmetry (for n < 200) have a poor 
performance. 



Discrete case In order to investigate the results of Subsection 14.2} we simulated 10 4 
samples of size n from various Poisson, binomial and negative binomial distributions. 
The Figure 11 shows the empirical size of the proposed test for Poisson for a nominal 
significance level of a = 0.05. The Figures 12 and 13 show the empirical power of this 
test for binomial and negative binomial distributions respectively (again, for a nominal 
significance level of 0.05). 

For the Poisson distribution the parameter A affects the critical region of the test. 
In order to investigate the size of the test, as A increases, we choose to simulate the 
Poisson distribution for A = 1, 10, 30 and 100. 
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Figure 11. Empirical type I error calculated from 10 4 random samples of size n = 50, 100, 150, 200, 
300, 400, 500, 750, 1000 simulated from the Poisson distribution with parameter A = 1,10,30,100 

(lines — , , ■ -) of the test for an underlying Poisson distribution for a nominal significance 

level of a = 0.05. 
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In order to investigate the power of the proposed test for Poisson, we choose to sim- 
ulate various binomial and negative binomial distributions, see Figures 12 and 13 re- 
spectively. Note that all binomial distributions (with parameters N, p) and all negative 
binomial distributions (with parameters r, p, i.e. with p.m.f. f(j) = ( J+r_ )p r (l — pY , 
j = 0,1,...) satisfy ( Ojl . 
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Figure 12. Empirical power calculated from 10 4 random samples of size n = 50, 100, 150, 200, 300, 
400, 500, 750, 1000 simulated from the binomial distribution with parameters N = 10, 30, 100 (lines 

— , , ■ • • ) and p = 0.20,0.35,0.50, 0.65 (points •, o, ■, □) of the test for an underlying Poisson 

distribution for a nominal significance level of a = 0.05. 
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Figure 13. Empirical power calculated from 10 random samples of size n = 50, 100, 150, 200, 300, 
400, 500, 750, 1000 simulated from the negative binomial distribution with parameters r and p, for 
r = 1, 10, 50, 100 (lines — ,---,•••,-•-) and p = 0.60, 0.70, 0.80, 0.90 (points ., o, ., □) of the 
test for an underlying Poisson distribution for a nominal significance level of a = 0.05. The results 
for the cases (r = l,p = 0.9, n < 500), (r = l,p = 0.8, n < 100) and (r = l,p = 0.7, n = 50) are not 
given, since the estimated matrix U> T; o (see Subsection 14. 2 p often is singular in these cases. 

From Figure 11, the size of the proposed test for Poisson is not affected when A > 10. 
Only in the case A = 1 this size is different. In the case A = 1 this test maintains 
the nominal level well for a sample size of 100 < n < 150, is liberal for n = 50 and 
conservative for n > 200. In the cases 10 < A < 100 this test maintains the nominal 
level well for a sample size of n > 150 and is liberal for n < 100. 

From Figure 12, the parameter N does not affect the power of the test, in contrast to 
the parameter p. As p increases, the power of the test increases for each n. Also, as n 
increases, the power of the test increases to one for each p. For the cases (p = 0.2, n < 
200) and (p = 0.35, n = 50) the test has a poor performance. 
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From Figure 13, the parameter r does not affect the power of the test, except in cases 
p = 0.8 and n < 300. Only the parameter p affects the power of the test. As p increases, 
the power of the test decreases for each n. Also, as n increases, the power of the test 
increases for p = 0.9 and increases to one for p < 0.8. For the cases (p = 0.8, n < 150) 
and (p = 0.9, n = 750) the test has a poor performance. 

Notice that we home also simulated negative binomial distributions for p < 0.5, r = 
1, 10, 50, 100 of size n = 50, 100, 150, 200, 300, 400, 500, 750, 1000 and in all cases the 
observed empirical power is one. 

Similar to the continuous case, we can construct tables for the average of the es- 
timators and for their MSEs. From these tables one can see that, as n increases, the 
averages of the estimators tend to their true values and the mean square errors decrease 
to zero. 
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